FILE: 5312USF.RTF 

PERIPHERAL DEVICE INTERFACE CHIP CACHE AND DATA 
SYNCHRONIZATION METHOD 

CROSS-REFERENCE TO RELATED APPLICATION 
This application claims the priority benefit of Taiwan application serial no. 
89111825, filed June 16, 2000, and Taiwan application serial no. 87119245, filed 
Novenber20, 1998. 

BACKGROUND OF THE INVENTION 

Field of Invention 

The present invention relates to a cache system and a method of synchronization 
data transmission. More particularly, the present invention relates to a cache system in 
a peripheral device interface control chip cache for synchronization data 
communication. 

Description of Related Art 

Although data buffers are frequently employed inside a peripheral device 
interface control chip to process read-out data, cache memory functions are mostly 
absent. Therefore, whenever peripheral devices need to recall data from a memory 
unit, the peripheral device interface control chipset must execute a read command and 
retrieve the relevant data from the memory unit. 
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Fig. 1 is a conventional timing diagram showing a portion of the peripheral 
device bus signals during a session when data is read from an external device to a 
peripheral device. As shown in Fig. 1, two batches of data are transmitted through the 
peripheral device bus. The two batches of data are transmitted starting at clock cycle 
5 CLK1 and CLK20 respectively. 

Starting at clock cycle CLK1, a FRAME signal is activated to indicate the 
initialization of data transmission. In the meantime, destination address of the target 
device for receiving the transmission data is put on the AD signal lines of bus. In 
clock cycle CLK2, an IRDY signal is activated to indicate readiness of transmission 
10 data of the peripheral device. However, the target is not yet ready to receive the 
transmission data and the transmission has to wait until the target ready signal TRDY 
arrives at clock cycle CLK8. Therefore, actual transmission is carried out only at the 
start of clock cycle CLK9. The period starting from the activation of the IRDY signal 
to the point just before the activation of the TRDY signal is known as a latency period. 

15 In Fig. 1, the transmission of the second batch of data starts at clock cycle 

CLK20 When a FRAME Signal is activated. The second batch of data is similar to the 
first batch of data or differs from the first batch by a single line. In other words, the 
second batch of data may be within a 32-byte address. Hence, control and signal 
timing of the signal bus by the peripheral device interface controller is similar to the 

20 transmission of the first batch of data. 

Starting at clock cycle CLK20, a FRAME signal is activated to indicate the 

initialization of data transmission. In the meantime, destination address of the target 
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device for receiving the transmission data is put on the AD signal lines of the bus. In 
clock cycle CLK21, an IRDY signal is activated to indicate readiness of transmission 
data of the peripheral device. However, the target is not yet ready to receive the 
transmission data and the transmission has to wait until the target ready signal TRDY 
5 arrives at clock cycle CLK27. Therefore, actual transmission is carried out only at the 
start of clock cycle CLK28. 

In brief, in conventional data transmission between two peripheral devices 
through an interface, although two consecutive batches of data have the same target 
address or differ just by a single line, a latency period is created in each transmission 
10 period. The extra latency period will lead to a slow down of the peripheral device 
interface and a reduction in the efficiency of peripheral device bus and peripheral 
device. 

SUMMARY OF THE INVENTION 

Accordingly, one object of the present invention is to provide a peripheral 
15 device interface control chip having a cache system therein for synchronization data 
communication with external devices. The cache system inside the control chip is 
capable of reducing latency period when data are read by a peripheral device, so that 
utilization of the peripheral device and the peripheral device bus is increased. 
Furthermore, correctness of transmitted data is further ensured through a data 
20 synchronization method. 

To achieve these and other advantages and in accordance with the purpose of the 
invention, as embodied and broadly described herein, the invention provides a 



3 



FILE: 5312USF.RTF 



peripheral device interface control chip having a cache system in a computer system. 
The computer system includes a set of memory units, a central processing unit (CPU), a 
CPU bus, a peripheral device bus and at least one peripheral device. The cache system 
includes a data buffer and a peripheral device interface controller. The data buffer is 

5 located within the control chip for holding a data stream read from the memory so that 
data required by the peripheral device are provided. In addition, if the data stream is 
synchronous to the data in the corresponding address within the memory, the data 
stream is retained. When any one of the peripheral devices demands data already in 
the data stream, data within the data stream can be immediately provided by the data 

10 bufferso that latent period for retrieving the data stream from memory again is reduced. 

The peripheral device controller is also installed within the control chip. The 
controller is used for determining if the data stream includes data demanded by a 
particular peripheral device, determining if the data stream is synchronous to the data in 
the corresponding address, retrieving the data stream from memory and putting the data 
15 in a data buffer, and switching the state of that portion of the data buffer having data 
stream therein. 

This invention also provides a method of synchronization data transmission 
between a cache inside a peripheral device interface control chip and an external device. 

The method can be applied to a computer system having a set of memory unit, at least 

20 one central processing unit, a control chip, a peripheral device bus, a CPU bus and at 
least one peripheral device. The control chip includes a peripheral device interface 
controller and a data buffer. Furthermore, when data stream within memory is read 
into the data buffer, the data stream becomes a buffer data stream. 
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The data synchronization method includes the following steps. On 
initialization, the data buffer is set to an empty state. When the peripheral device 
interface controller reads a buffer data stream into the data buffer according to the 
requirement of a particular peripheral device, the buffered data portion of the data buffer 
5 that includes the buffer data stream is set to a clean-unaccessed state. If the peripheral 
device interface controller detects from the CPU bus a write or a read operation using 
the corresponding address when the said buffered data portion is in a clean-unaccessed 
state, the buffered data portion is set to a dirty-unaccessed state. 

In addition, If the peripheral device interface controller detects from the 
10 peripheral device bus a write operation using the corresponding address when the 
buffered data is in the clean-unaccessed state, the buffered data portion is set to a dirty- 
unaccessed state. 

If the particular peripheral device that demands the buffer data stream reads the 
buffer data stream from the buffered data portion when the buffered data portion is in 
15 the clean-unaccessed state, the buffered data portion is set to a clean-accessed state. 

If the particular peripheral device that demands the buffer data stream reads the 
buffer data stream from the buffered data portion when the buffered data portion is in a 
dirty-unaccessed state, the buffered data portion is set to an empty state. 

Furthermore, if the peripheral device interface controller detects from the CPU 
20 bus a read or a write operation using the corresponding address when the buffered data 
portion is in a clean-accessed state, the buffered data portion is set to an empty state. 
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If the peripheral device interface controller detects from the peripheral device 
bus a write operation using the corresponding address when the buffered data portion is 
in the clean-accessed state, the buffered data portion is set to an empty state. 

This invention also provides a peripheral device interface control chip having a 
5 cache system therein and a method of synchronization in data transmission with 
external devices. The method can be applied to a computer system having a set of 
memory unit, at least one central processing unit, a control chip, a peripheral device bus, 
a CPU bus and at least one peripheral device. The control chip includes a peripheral 
device interface controller and a data buffer. The central processing unit uses a 
10 MOESI protocol. Documents relating to the techniques and theories behind the 
operation of MOESI protocol can be found by contacting the following Internet address: 
"http://www.sun.com/microelectronics/datasheets/stpl030/10.html". When memory 
data stream within the memory is read into the central processing unit, the memory data 
stream becomes a cache data stream. On the other hand, when memory data stream is 
15 read into the buffer data, the data stream becomes a buffer data stream. 

The data synchronization method includes the following steps. First, if the data 
buffer executes a read operation from an address in memory that corresponds to the 
cache data stream when the cache data stream is in modified state, then the peripheral 
device interface controller will inform the central processing unit to set the cache data 
20 stream into an owner state. 
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If the data buffer executes a read operation from the corresponding address when 
the cache data stream is in an exclusive state, the peripheral device interface controller 
will inform the central processing unit to set the cache data stream into a shared state. 

In brief, this invention utilizes the data buffer inside a control chip to store read- 
5 out data as well as other data in neighboring addresses so that latency period of reading 
data by peripheral devices is greatly reduced. Hence, utilization of peripheral devices 
and peripheral device bus is increased correspondingly. Furthermore, by using a data 
synchronization method, correctness of transmitted data is ensured. 

It is to be understood that both the foregoing general description and the 
10 following detailed description are exemplary, and are intended to provide further 
explanation of the invention as claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings are included to provide a further understanding of 
the invention, and are incorporated in and constitute a part of this specification. The 
15 drawings illustrate embodiments of the invention and, together with the description, 
serve to explain the principles of the invention. In the drawings, 

Fig. 1 is a conventional timing diagram showing a portion of the bus signals 
during a session when two batches of data are read from memory to a peripheral 
device; 
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Fig. 2 is a block diagram showing the connections between different modules in 
a computer system according to one preferred embodiment of this invention; 

Fig. 3 is a timing diagram showing a portion of the bus signals during a session 
when two batches of data are read from memory to a peripheral device according to this 
5 invention; 

Fig. 4 is a state transition diagram showing data transition between different 
states inside a data buffer according to this invention; and 

Fig. 5 is a state transition diagram showing the transition of states for the data in 
a central processing unit operating under a MOESI protocol according to this invention. 

10 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the present preferred embodiments of 
the invention, examples of which are illustrated in the accompanying drawings. 
Wherever possible, the same reference numbers are used in the drawings and the 
description to refer to the same or like parts. 

15 The invention also allows a CPU interface to control the writing process, 

including receiving a write request and data from a CPU and then writing the data to a 
memory of a memory circuit. After the CPU interface receives a write request from 
the CPU, the CPU interface sends a dummy request to the memory control circuit to 
pre-charge and activate the designative memory page of the memory circuit before the 

20 data is sent to the memory circuit. Since the designative memory page is always pre- 

8 



FILE: 5312USF.RTF 

charged and activated while the data is received at the memory control circuit, the 
memory control circuit sends only a write commands to the memory for writing the data 
to the memory without further pre-charging and activating the designative memory page. 
Therefore, the total number of clock cycles required for processing a write request is 
5 shortened. 

Fig. 2 is a block diagram showing the connections between different modules in 
a computer system according to one preferred embodiment of this invention. As 
shown in Fig. 2, the computer system includes a memory unit 30, a control chip 10, a 
peripheral device 50, a central processing unit (CPU) 40, a peripheral device bus 15, a 
10 CPU bus 45, a memory bus 35, a peripheral device interface controller 20 and a data 
buffer 25. 

The data buffer 25 is installed within control chip 10 for holding data stream 
read from memory unit 30 so that data needed by peripheral device 50 are provided. 
Peripheral device interface controller 20 is also installed inside control chip 10 for 

15 detecting if the data stream within the data buffer 25 contains the data demanded by the 
peripheral device 50. In addition, peripheral device interface controller 20 also detects 
among the peripheral device bus 15 and the CPU bus 45 any action of the data on the 
corresponding address of the memory unit 30 to judge if the data stream is still valid or 
not. Peripheral device interface controller 20 will retain the data stream until other 

20 data stream needs to use this portion of area or the data stream is already not in 
synchrony with the correct data. 



9 



FILE: 5312USF.RTF 



Furthermore, peripheral device interface controller 20 is capable of retrieving a 
data stream from memory unit 30 according to a demand from peripheral device 50 and 
putting the data stream in data buffer 25. In the meantime, peripheral device interface 
controller 20 will issue a probe-hit-read signal to the central processing unit 40. 

5 Moreover, the aforementioned act of detecting action in corresponding address also 
includes accepting signals produced by peripheral device bus 15 during an operation of 
writing data to the corresponding address, accepting signals produced by CPU bus 45 
during a write operation and accepting signals produced by CPU bus 45 during a read 
operation. In addition, peripheral device interface controller 20 also controls state 

10 transition in the data buffer 25 according to the result of detection. 

In this embodiment, data buffer 25 includes eight lines with each line having a 
holding capacity of 32 bytes. Hence, overall holding capacity of data buffer 25 is 256 
bytes. In addition, every two lines are grouped together as a transmission block with 
each transmission block holding the data in a data read transmission. Since two lines 
15 are used to form a transmission block, the next batch of data is also read out besides the 
batch of data demanded by the peripheral device 50. In other words, altogether 64 
bytes are read out. 

Fig. 3 is a timing diagram showing a portion of the bus signals during a session 
when two batches of data are read from memory to a peripheral device according to this 
20 invention. As shown in Fig. 1 and Fig. 3, the clocking pulse for reading the first batch 
of data are identical. However, after reading the first batch of data, the peripheral 
device interface controller 20 reads in 64 bytes of data that also includes the first batch 
of data. All these data are stored in a transmission block within the data buffer 25. 
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Hence, when the peripheral device 50 needs to read the second batch of data and if the 
address of the second batch of data is identical to the first batch or the address is within 
the same 64-byte transmission block, latency of the second data read is only from clock 
cycle CLK21 to CLK22 as shown in Fig. 3. In other words, the system has a latency 
5 period of only one clock cycle instead of six clock cycles in a conventional system. 
Therefore, the system of this invention is capable of increasing peripheral device 
interface transmission rate considerably. 

Fig. 4 is a state transition diagram showing data transition between different 
states inside a data buffer 25 according to this invention. Data within the data buffer 
10 25 can be in one of four states, including: an empty state 400, a clean-unaccessed state 
410, a dirty-unaccessed state 430 and a clean-accessed state 420. 

Any one of the memory data streams in memory 30 read out by central 
processing unit 40 forms a cache data stream. The memory data stream is stored in 
data buffer 25 to form a buffer data stream. The memory data stream, the cache data 
15 stream and the buffer data stream all correspond to the same address in memory unit 30. 

The data synchronization method includes the following steps. On 
initialization, data buffer 25 is set to an empty state 400. When peripheral device 
interface controller 20 reads a memory data stream from memory 30 into data buffer 25 
according to the requirement of a particular peripheral device 50, the portion of data 
20 buffer 25 that stores the memory data stream, or in other words, the buffered data 
portion of the data buffer 25 that stores the buffer data stream is set to a clean- 
unaccessed state 410 via process 440. 
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When the said buffered data portion is in a clean-unaccessed state 410 and if 
peripheral device interface controller 20 detects from CPU bus 45 a write or a read 
operation using the corresponding address, or, if peripheral device interface controller 
20 detects from peripheral device bus 15 a write operation using the corresponding 
5 address, the buffered data portion is set to a dirty-unaccessed state 430 via process 450. 

When the buffered data portion is in a clean-unaccessed state 410 and if the 
particular peripheral device 50 that demands the buffer data stream reads the buffer data 
stream from data buffer 25, the buffered data portion is set by peripheral device 
interface controller 20 to a clean-accessed state 420 via process 460. 

10 When the buffered data portion is in a dirty-unaccessed state 430 and if the 

particular peripheral device 50 that demands the buffer data stream reads the buffer data 
stream from the data buffer 25, the buffered data portion is set to an empty state 400 via 
process 470. 

Furthermore, when the buffered data portion is in a clean-accessed state 420 and 
15 if peripheral device interface controller 20 detects from CPU bus 45 a read or a write 
operation using the corresponding address, or, if peripheral device interface controller 
20 detects from peripheral device bus 15 a write operation using the corresponding 
address, the buffered data portion is set to an empty state 400 via process 480. 

Fig. 5 is a state transition diagram showing the transition of states for the data in 
20 the cache memory of a central processing unit operating under a MOESI protocol 
according to this invention. As shown in Fig. 5, the MOESI protocol includes five 
states including: a modified state (M) 510, an owner state (O) 540, an exclusive state (E) 
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520, a shared state (S) 530 and an invalid state (I) 500. The MOESI protocol is a 
means of maintaining uniformity in data transmission between the cache memory inside 
a central processing unit and any external cache memory unit. 

When the cache memory is in an invalid state 500 and if data is read from the 
memory unit and the data is not present in other cache memory, the cache memory will 
transit from invalid state 500 to an exclusive state 520 (an I-E conversion). If data 
read from the memory unit is present in other cache memory, the cache memory will 
transit from invalid state 500 to a shared state 530 (an I-S conversion). If the cache 
memory generates a write miss signal, data will automatically read from the memory 
unit and the cache memory will transit from invalid state 500 to a modified state 510 (an 
I-M conversion). 

If the cache memory inside the central processing unit is in the exclusive state 
520 and the cache memory needs to modify the data, the cache memory will transit 
from the exclusive state 520 to a modified state 510 (an E-M conversion). If system 
demands the data in the exclusive state 520, the data will transit from the exclusive state 
520 to a shared state 530 (an E-S conversion). If the data need to be cleared or other 
data need to be stored in the storage area, the data will transit from the exclusive state 
520 to an invalid state 500 (an E-I conversion). 

Furthermore, if the cache memory inside the central processing unit is in a 
shared state 530 and data need to be stored, the cache memory will transit from the 
shared state 530 to a modified state 510 (a S-M conversion). If data need to be cleared 
for storing other data, or if copies of the data in other cache memory need to be 
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modified, the data will transit from the shared state 530 to an invalid state 500 (a S-I 
conversion). 

When the cache memory inside the central processing unit is in a modified state 
510 and if other cache memory issues a read signal for reading data from the internal 

5 cache memory, the data will transit from the modified state 5 10 to an owner state 540 (a 
M-O conversion). On the other hand, if data need to be cleared for the storage of other 
data or other cache memory needs to write the data, the data will transit from the 
modified state 510 to an invalid state 500 (an M-I conversion). In addition, if other 
cache memory needs to read the data, the data will transit from the modified state 510 

10 into a shared state 530 (an M-S conversion). 

Finally, if the cache memory inside the central processing unit is in the owner 
state 540 and if the data need to be written in, the data will transit from the owner state 
540 into a modified state 510 (an O-M conversion). 

The following is a description of probe-hit-read signal emitted by the peripheral 
15 device interface controller. The probe-hit-read signal is transmitted to the central 
processing unit when data are read from the memory unit to the data buffer. The 
probe-hit-read signal further includes the corresponding address for proper reading of 
data from the memory unit to the data buffer. 

Figs. 2, 4, and 5 together also illustrate another preferred embodiment of this 
20 invention. If the central processing unit operates using the said MOESI protocol and if 
the data within the cache memory are in an exclusive state 520, when a probe-hit-read 
signal sent from the peripheral device interface controller arrives, and if the address 
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included within the probe-hit-read signal is identical to the address of the data inside the 
cache memory, the data within the cache memory of the central processing unit will 
transit from the exclusive state 520 to a shared state 530 via an E-S conversion process 
560. 

5 Furthermore, if the central processing unit operates using the said MOESI 

protocol and if the data within the cache memory are in a modified state 510, and if the 
address included within the probe-hit-read signal is identical to the address of the data 
inside the cache memory, the data within the cache memory of the central processing 
unit will transit from the modified state 510 to an owner state 540 via a M-0 conversion 
10 process 550. 

In summary, the incorporation of a data buffer in the northbridge chipset of this 
invention is able to reduce latency period when data are read from a memory unit to a 
peripheral device. Hence, the utilization of both the peripheral device and the 
peripheral device bus is increased. Furthermore, data integrity is ensured through a 
15 data synchronization scheme. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made to the structure of the present invention without departing from 
the scope or spirit of the invention. In view of the foregoing, it is intended that the 
present invention cover modifications and variations of this invention provided they fall 
20 within the scope of the following claims and their equivalents. 
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